Behavior coordination for a mobile robot using modular reinforcement learning
نویسندگان
چکیده
Coordination of multiple behaviors independently obtained by a reinforcement learning method is one of the issues in order for the method to be scaled to larger and more complex robot learning tasks. Direct combination of all the state spaces for individual modules (subtasks) needs enormous learning time, and it causes hidden states. This paper presents a method of modular learning which coordinates multiple behaviors taking account of a trade-o between learning time and performance. First, in order to reduce the learning time the whole state space is classi ed into two categories based on the action values separately obtained by Q learning: the area where one of the learned behaviors is directly applicable (no more learning area), and the area where learning is necessary due to the competition of multiple behaviors (re-learning area). Second, hidden states are detected by model tting to the learned action values based on the information criterion. Finally, the initial action values in the relearning area are adjusted so that they can be consistent with the values in the no more learning area. The method is applied to one to one soccer playing robots. Computer simulation and real robot experiments are given to show the validity of the proposed method.
منابع مشابه
Dynamic Obstacle Avoidance by Distributed Algorithm based on Reinforcement Learning (RESEARCH NOTE)
In this paper we focus on the application of reinforcement learning to obstacle avoidance in dynamic Environments in wireless sensor networks. A distributed algorithm based on reinforcement learning is developed for sensor networks to guide mobile robot through the dynamic obstacles. The sensor network models the danger of the area under coverage as obstacles, and has the property of adoption o...
متن کاملClay: Integrating Motor Schemas and Reinforcement Learning 1 Background and Related Work 1.1 Motor Schemas
Clay is an evolutionary architecture for autonomous robots that integrates motor schema-based control and reinforcement learning. Robots utilizing Clay beneet from the real-time performance of motor schemas in continuous and dynamic environments while taking advantage of adaptive reinforcement learning. Clay coordinates assemblages (groups of motor schemas) using embedded reinforcement learning...
متن کاملA Distributed Adaptive Control Architecture for Autonomous Agents
Recently considerable interest in behavior-based robots has been generated by industrial, space and defence related activities. Such independent robots are envisioned to perform tasks where safety or economic factors prevent direct human control and communication difficulties prevent easy remote control. Although many successes have been reported using behavior-based robots with prespecified sk...
متن کاملTeaching Robots to Coordinate their Behaviours
Behaviour co-ordination is one of the major problems in behaviour-based robotics. This paper presents a teaching method for mobile robots to learn the behaviour coordination. In this method, the sensory information is abstracted into a limited number of feature states that correspond to physical events in the interactive process between a robot and its environment. The continuous motor actions ...
متن کاملCollaborative control in a humanoid dynamic task
This paper describes a collaborative control scheme that governs the dynamic behavior of an articulated mobile robot with several degrees of freedom (DOF) and redundancies. These types of robots need a high level of coordination between the motors performance to complete their motions. In the employed scheme, the actuators involved in a specific task share information, computing integrated cont...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1996